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(1) GENERAL INFORMATION: 

(i) APPLICANT: Chatterjee, Deb K. 

(ii) TITLE OF INVENTION: Cloned DNA Polymerases from Thermotoga 
maritima and Mutants Thereof 

(iii) NUMBER OF SEQUENCES: 4 7 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: STERNE, KESSLER, GOLDSTEIN & FOX, P.L.L.C. 

(B) STREET: 1100 New York Ave., N.W., Suite 600 

(C) CITY: Washington 

(D) STATE: DC 

(E) COUNTRY: USA 

(F) ZIP: 20005-3934 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09/229,173 

(B) FILING DATE: 13-JAN-1999 

( C ) CLAS S IF I CAT I ON : 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/706,702 
(B)_ JTILING DATEj^ 06-SEP-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/689,807 

(B) FILING DATE: 14-AUG-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/537,400 

(B) FILING DATE: 02-OCT-1995 
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(A) APPLICATION NUMBER: US 08/576,759 

(B) FILING DATE: 21-DEC-1995 
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(A) APPLICATION NUMBER: US 08/537,397 

(B) FILING DATE: 02-OCT-1995 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/525,057 

(B) FILING DATE: 08-SEP-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Millonig, Robert C. 

(B) REGISTRATION NUMBER: 34,395 

(C) REFERENCE /DOCKET NUMBER: 0942.2800008 



(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 202-371-2600 
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(B) TELEFAX: 202-371-2540 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Arg Xaa Xaa Xaa Lys Xaa Xaa Xaa Phe Xaa Xaa Xaa Tyr Xaa 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
(B) LOCATION: 1. .2679 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

ATG GCG AGA CTA TTT CTC TTT GAT GGC ACA GCC CTG GCC TAC AGG CCA 
Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Sa ^r Arg £a 

Ivl Ifr Sa Z? f C f A 1°° TCC ACA TCC ACA GGA ATT CCA ACG 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly lie Pro Thr 

20 25 30 

AAC GCC GTC TAT GGC GTT GCC AGG ATG CTC GTT AAA TTC ATT AAG GAA 
Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe He SJs ^ 
35 40 45 

CAC ATT ATA CCC GAA AAG GAC TAC GCG GCT GTG GCC TTC GAC AAG AAG 
lie He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
^ u 55 60 

aS M» ll C ? G A u AC CTG CTC GTA AGC GAC ^ G CCG CAA AGG 

Ala Ala Thr Phe Arg His Lys Leu Leu Val Ser Asp Lys Ala Gin Arg 

70 75 80 

S ?SS n? T CTA GTT CAG CAG CTA CCT TAC ATC ^G CGG 

Pro Lys Thr Pro Ala Leu Leu Val Gin Gin Leu Pro Tyr He Lys Arg 

85 90 95 



48 



96 



144 



192 



240 



288 
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CTG ATA GAA GCT CTT GGT TTC AAA GTG CTG GAG CTG GAG GGA TAC GAA 336 

Leu lie Glu Ala Leu Gly Phe Lys Val Leu Glu Leu Glu Gly Tyr Glu 

100 105 110 

GCA GAC GAT ATC ATC GCC ACG CTT GCA GTC AGG GCT GCA CGT TTT TTG 38 4 

Ala Asp Asp lie He Ala Thr Leu Ala Val Arg Ala Ala Arg Phe Leu 
115 120 125 

ATG AGA TTT TCA TTA ATA ACC GGT GAC AAG GAT ATG CTT CAA CTT GTA 4 32 

Met Arg Phe Ser Leu He Thr Gly Asp Lys Asp Met Leu Gin Leu Val 
130 135 140 

AAC GAG AAG ATA AAG GTC TGG AGA ATC GTC AAG GGG ATA TCG GAT CTT 48 0 

Asn Glu Lys He Lys Val Trp Arg He Val Lys Gly He Ser Asp Leu 
145 150 155 160 

GAG CTT TAC GAT TCG AAA AAG GTG AAA GAA AGA TAC GGT GTG GAA CCA 528 
Glu Leu Tyr Asp Ser Lys Lys Val Lys Glu Arg Tyr Gly Val Glu Pro 
165 170 175 

CAT CAG ATA CCG GAT CTT CTA GCA CTG ACG GGA GAC GAC ATA GAC AAC 57 6 

His Gin He Pro Asp Leu Leu Ala Leu Thr Gly Asp Asp He Asp Asn 
180 185 190 

ATT CCC GGT GTA ACG GGA ATA GGT GAA AAG ACC GCT GTA CAG CTT CTC 624 
He Pro Gly Val Thr Gly lie Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

GGC AAG TAT AGA AAT CTT GAA TAC ATT CTG GAG CAT GCC CGT GAA CTC 67 2 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
210 215 220 

CCC CAG AGA GTG AGA AAG GCT CTC TTG AGA GAC AGG GAA GTT GCC ATC 7 20 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala lie 

"225 230" 235 2^0 y ~ ~ 

CTC AGT AAA AAA CTT GCA ACT CTG GTG ACG AAC GCA CCT GTT GAA GTG 7 68 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

GAC TGG GAA GAG ATG AAA TAC AGA GGA TAC GAC AAG AGA AAA CTA CTT 816 
Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

CCG ATA TTG AAA GAA CTG GAG TTT GCT TCC ATC ATG AAG GAA CTT CAA 8 64 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

CTG TAC GAA GAA GCA GAA CCC ACC GGA TAC GAA ATC GTG AAG GAT CAT 912 
Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu lie Val Lys Asp His 
290 295 * 300 

AAG ACC TTC GAA GAT CTC ATC GAA AAG CTG AAG GAG GTT CCA TCT TTT 960 
Lys Thr Phe Glu Asp Leu He Glu Lys Leu Lys Glu Val Pro Ser Phe 
305 310 315 320 

GCC CTG GAC CTT GAA ACG TCC TCC TTG GAC CCG TTC AAC TGT GAG ATA 1008 
Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

GTC GGC ATC TCC GTG TCG TTC AAA CCG AAA ACA GCT TAT TAC ATT CCA 105 6 

Val Gly lie Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 



1152 



1200 



1248 



1296 



1344 
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340 345 350 

CTT CAT CAC AGA AAC GCC CAC AAT CTT GAT GAA ACA CTG GTG CTG TCG 1104 
Leu H 1S His Arg Asn Ala His Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 360 365 

AAG TTG AAA GAG ATC CTC GAA GAC CCG TCT TCG AAG ATT GTG GGT CAG 
Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys lie Val Gly Gin 

375 380 

AAC CTG AAG TAC GAC TAC AAG GTT CTT ATG GTA AAG GGT ATA TCG CCA 
Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly lie Ser Pro 
JS5 390 395 400 

GTT TAT CCG CAT TTT GAC ACG ATG ATA GCT GCA TAT TTG CTG GAG CCA 
Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu So 
405 410 415 

AAC GAG AAA AAA TTC AAT CTC GAA GAT CTG TCT TTG AAA TTT CTC GGA 
Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 430 

TAC AAA ATG ACG TCT TAT CAG GAA CTG ATG TCG TTT TCC TCA CCA CTT 
Tyr Lys Met Thr Ser Tyr Gin Glu. Leu Met Ser Phe Ser Ser Pro Leu 
435 440 445 

TTT GGT TTC AGC TTT GCG GAT GTT CCG GTA GAC AAG GCT GCC GAA TAC 1392 
Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Glu Tyr 
4bu 455 46 0 

TCC TGC GAG GAT GCA GAC ATC ACT TAT AGG CTC TAC AAG ATA CTC AGC 
Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
bb 470 475 480 

_AT.a AAG CTC CAT GAA GCG GAA CTT GAG AAC G'f C TTC ~AC AGG ATA GAG 14 88 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
48 5 490 495 

ATG CCG TTG GTG AAC GTC TTG GCA CGA ATG GAA TTC AAC TGG GTG TAT 1 535 
Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Phe Asn Trp Val Tyr 
500 505 510 

vll 2p G?S HI T CTG ?** TCG GAG GAG TAC GGC AAG 1584 

val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 

515 520 525 

CTC GAG GAA CTG GCC GAA AAA ATC TAC CAG ATA GCA GGT GAG CCC TTC 
Leu Glu Glu Leu Ala Glu Lys He Tyr Gin He Ala Gly Glu Pro Phe 
"0 535 540 

AAC ATC AAT TCT CCA AAA CAG GTT TCA AAC ATC CTT TTT GAG AAG CTG 1680 
Asn He Asn Ser Pro Lys Gin Val Ser Asn He Leu Phe Glu Lys Leu 
b45 550 555 56Q 

GGA ATA AAA CCC CGT GGA AAA ACG ACA AAA ACA GGA GAT TAC TCT ACC 1728 
Gly He Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 570 • 575 

AGG ATA GAG GTG TTG GAA GAG ATA GCG AAT GAG CAC GAG ATA GTA CCC 
Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu lie Val Pro 
580 585 59Q 



1440 



1632 



1776 
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CTC ATT CTC GAG TTC AGA AAG ATC CTG AAA CTG AAA TCG ACC TAC ATA 182 4 

Leu lie Leu Glu Phe Arg Lys lie Leu Lys Leu Lys Ser Thr Tyr lie 
595 600 605 

GAC ACC CTT CCG AAA CTT GTG AAC CCG AAA ACC GGA AGA TTT CAT GCA 18 72 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg Phe His Ala 
610 615 620 

TCT TTC CAC CAG ACG GGT ACC GCC ACT GGC AGG TTG AGT AGC AGT GAT 192 0 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

CCA AAT CTT CAG AAT CTT CCG ACA AAG AGC GAA GAG GGA AAA GAA ATT 1968 
Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu lie 
645 650 655 

AGA AAA GCG ATT GTG CCC CAG GAT CCA GAC TGG TGG ATC GTC AGT GCG 2016 
Arg Lys Ala lie Val Pro Gin Asp Pro Asp Trp Trp lie Val Ser Ala 
660 665 670 

'GAT TAT TCC CAA ATA GAA CTC AGA ATC CTC GCT CAT CTC AGT GGT GAT 2 0 64 

Asp Tyr Ser Gin lie Glu Leu Arg lie Leu Ala His Leu Ser Gly Asp 
675 680 685 

GAG AAC CTT GTG AAG GCC TTC GAG GAG GGC ATC GAT GTG CAC ACC TTG 2112 
Glu Asn Leu Val Lys Ala Phe Glu Glu Gly lie Asp Val His Thr Leu 
690 695 700 

ACT GCC TCC AGG ATC TAC AAC GTA AAG CCA GAA GAA GTG AAC GAA GAA 2160 
Thr Ala Ser Arg lie Tyr Asn Val Lys Pro Glu Glu Val Asn Glu Glu 
705 710 715 720 

ATG CGA CGG GTT GGA AAG ATG GTG AAC TTC TCT ATA ATA TAC GGT GTC 2208 

Met _Arg_ Arg .JVal_ Gly :_Lys Met _Val_ Asn Phe_Ser Ile_Ile Tyr_Gly_Val _ 

725 730 735 

ACA CCG TAC GGT CTT TCT GTG AGA CTT GGA ATA CCG GTT AAA GAA GCA 225 6 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly lie Pro Val Lys Glu Ala 
740 745 750 

GAA AAG ATG ATT ATC AGC TAT TTC ACA CTG TAT CCA AAG GTG CGA AGC 2 304 

Glu Lys Met lie lie Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

TAC ATC CAG CAG GTT GTT GCA GAG GCA AAA GAG AAG GGC TAC GTC AGG 2 352 

Tyr lie Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

ACT CTC TTT GGA AGA AAA AGA GAT ATT CCC CAG CTC ATG GCA AGG GAC 2 4 00 

Thr Leu Phe Gly Arg Lys Arg Asp lie Pro Gin Leu Met Ala Arg Asp 
785 790 795 800 

AAG AAC ACC CAG TCC GAA GGC GAA AGA ATC GCG ATA AAC ACC CCC ATT 24 4 8 

Lys Asn Thr Gin Ser Glu Gly Glu Arg lie Ala lie Asn Thr Pro lie 
805 810 815 

CAG GGA ACT GCG GCA GAT ATA ATA AAA TTG GCT ATG ATA GAT ATA GAC 24 96 

Gin Gly Thr Ala Ala Asp lie lie Lys Leu Ala Met lie Asp lie Asp 
820 825 830 

GAG GAG CTG AGA AAA AGA AAC ATG AAA TCC AGA ATG ATC ATT CAG GTT 2 54 4 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met lie lie Gin Val 
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835 840 845 

CAT GAC GAA CTG GTC TTC GAG GTT CCC GAT GAG GAA AAA' GAA GAA CTA 2 5 92 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 

GTT GAT CTG GTG AAG AAC AAA ATG ACA AAT GTG GTG AAA CTC TCT GTG 2 64 0 

Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

CCT CTT GAG GTT GAC ATA AGC ATC GGA AAA AGC TGG TCT TGA 2 682 

Pro Leu Glu Val Asp lie Ser lie Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 93 amino acids 

(B) TYPE: amino acid 
V (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ala Arg Leu Phe Leu Phe Asp Gly Thr Ala Leu Ala Tyr Arg Ala 
15 10 15 

Tyr Tyr Ala Leu Asp Arg Ser Leu Ser Thr Ser Thr Gly lie Pro Thr 
20 25 30 

Asn Ala Val Tyr Gly Val Ala Arg Met Leu Val Lys Phe lie Lys Glu 
35 40 45 

His He He Pro Glu Lys Asp Tyr Ala Ala Val Ala Phe Asp Lys Lys 
50 55 60 



Ala Ala Thr Phe 
65 



Pro Lys Thr Pro 



Leu He Glu Ala 
100 



Ala Asp Asp He 
115 



Met Arg Phe Ser 
130 



Asn Glu Lys He 
145 



Glu Leu Tyr Asp 



His Gin lie Pro 
180 



Arg His Lys Leu 
70 

Ala Leu Leu Val 
85 

Leu Gly Phe Lys 



He Ala Thr Leu 
120 

Leu He Thr Gly 
135 

Lys Val Trp Arg 
150 

Ser Lys Lys Val 
165 

Asp Leu Leu Ala 



Leu Val Ser Asp 
75 

Gin Gin Leu Pro 
90 

Val Leu Glu Leu 
105 

Ala Val Arg Ala 



Asp Lys Asp Met 
140 

He Val Lys Gly 
155 

Lys Glu Arg Tyr 
170 

Leu Thr Gly Asp 
185 



Lys Ala Gin Arg 
80 

Tyr He Lys Arg 
95 

Glu Gly Tyr Glu 
110 

Ala Arg Phe Leu 
125 

Leu Gin Leu Val 



lie Ser Asp Leu 
160 

Gly Val Glu Pro 
175 

Asp He Asp Asn 
190 
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Ile Pro Gly Val Thr Gly lie Gly Glu Lys Thr Ala Val Gin Leu Leu 
195 200 205 

Gly Lys Tyr Arg Asn Leu Glu Tyr He Leu Glu His Ala Arg Glu Leu 
Z1U 215 220 

Pro Gin Arg Val Arg Lys Ala Leu Leu Arg Asp Arg Glu Val Ala He 

230 235 240 

Leu Ser Lys Lys Leu Ala Thr Leu Val Thr Asn Ala Pro Val Glu Val 
245 250 255 

Asp Trp Glu Glu Met Lys Tyr Arg Gly Tyr Asp Lys Arg Lys Leu Leu 
260 265 270 

Pro He Leu Lys Glu Leu Glu Phe Ala Ser He Met Lys Glu Leu Gin 
275 280 285 

Leu Tyr Glu Glu Ala Glu Pro Thr Gly Tyr Glu He Val Lys Asp His 

295 300 

Lys Thr Phe Glu Asp Leu lie Glu Lys Leu Lys Glu Val Pro Ser Phe 

310 315 . 320 

Ala Leu Asp Leu Glu Thr Ser Ser Leu Asp Pro Phe Asn Cys Glu He 
325 330 335 

Val Gly He Ser Val Ser Phe Lys Pro Lys Thr Ala Tyr Tyr He Pro 
- i4U 345 350 

Leu His His Arg Asn Ala His Asn Leu Asp Glu Thr Leu Val Leu Ser 
355 3 6 o 365 

Lys Leu Lys Glu He Leu Glu Asp Pro Ser Ser Lys He Val Gly Gin 

J/u -375- 380 

Asn Leu Lys Tyr Asp Tyr Lys Val Leu Met Val Lys Gly He Ser Pro 

390 395 40Q 

Val Tyr Pro His Phe Asp Thr Met He Ala Ala Tyr Leu Leu Glu Pro 
405 410 415 

Asn Glu Lys Lys Phe Asn Leu Glu Asp Leu Ser Leu Lys Phe Leu Gly 
420 425 43Q 

Tyr Lys Met Thr Ser Tyr Gin Glu Leu Met Ser Phe Ser Ser Pro Leu 
435 44 0 445 

Phe Gly Phe Ser Phe Ala Asp Val Pro Val Asp Lys Ala Ala Glu Tyr 
4:3U 455 460 

Ser Cys Glu Asp Ala Asp He Thr Tyr Arg Leu Tyr Lys He Leu Ser 
6b 470 475 48Q 

Met Lys Leu His Glu Ala Glu Leu Glu Asn Val Phe Tyr Arg He Glu 
485 490 495 

Met Pro Leu Val Asn Val Leu Ala Arg Met Glu Phe Asn Trp Val Tvr 
5 °0 505 510 

Val Asp Thr Glu Phe Leu Lys Lys Leu Ser Glu Glu Tyr Gly Lys Lys 
515 520 525 
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Leu Glu Glu Leu Ala Glu Lys lie Tyr Gin lie Ala Gly Glu Pro Phe 
530 535 540 

Asn lie Asn Ser Pro Lys Gin Val Ser Asn lie Leu Phe Glu Lys Leu 
545 550 555 560 

Gly lie Lys Pro Arg Gly Lys Thr Thr Lys Thr Gly Asp Tyr Ser Thr 
565 570 575 

Arg He Glu Val Leu Glu Glu He Ala Asn Glu His Glu He Val Pro 
580 585 590 

Leu He Leu Glu Phe Arg Lys He Leu Lys Leu Lys Ser Thr Tyr He 
595 600 605 

Asp Thr Leu Pro Lys Leu Val Asn Pro Lys Thr Gly Arg Phe His Ala 
610 615 620 

Ser Phe His Gin Thr Gly Thr Ala Thr Gly Arg Leu Ser Ser Ser Asp 
625 630 635 640 

Pro Asn Leu Gin Asn Leu Pro Thr Lys Ser Glu Glu Gly Lys Glu He 
645 650 655 

Arg Lys Ala He Val Pro Gin Asp Pro Asp Trp Trp He Val Ser Ala 
660 665 670 

Asp Tyr Ser Gin He Glu Leu Arg lie Leu Ala His Leu Ser Gly Asp 
675 680 685 

Glu Asn Leu Val Lys Ala Phe Glu Glu Gly lie Asp Val His Thr Leu 
690 695 700 

Thr__Ala. .Ser_Arg__Il_e JT.yr_ Asn_Val_.Lys_.Pro_. Glu Glu Val _Asn_.Glu_.Glu_ 
705 710 715 720 

Met Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly Val 
725 730 735 

Thr Pro Tyr Gly Leu Ser Val Arg Leu Gly He Pro Val Lys Glu Ala 
740 745 750 

Glu Lys Met He lie Ser Tyr Phe Thr Leu Tyr Pro Lys Val Arg Ser 
755 760 765 

Tyr He Gin Gin Val Val Ala Glu Ala Lys Glu Lys Gly Tyr Val Arg 
770 775 780 

Thr Leu Phe Gly Arg Lys Arg Asp He Pro Gin Leu Met Ala Arg Asp 
785 ' 790 795 800 

Lys Asn Thr Gin Ser Glu Gly Glu Arg He Ala He Asn Thr Pro He 
805 810 815 

Gin Gly Thr Ala Ala Asp He He Lys Leu Ala Met He Asp He Asp 
820 825 830 

Glu Glu Leu Arg Lys Arg Asn Met Lys Ser Arg Met He He Gin Val 
835 840 845 

His Asp Glu Leu Val Phe Glu Val Pro Asp Glu Glu Lys Glu Glu Leu 
850 855 860 
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Val Asp Leu Val Lys Asn Lys Met Thr Asn Val Val Lys Leu Ser Val 
865 870 875 880 

Pro Leu Glu Val Asp lie Ser lie Gly Lys Ser Trp Ser 
885 890 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 
/ (D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Pro Ser Phe Ala Leu Asp Leu Glu Thr Ser Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Pro Val Phe Ala Phe Asp Thr Glu Thr Asp Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Gly Pro Val Ala Phe Asp Ser Glu Thr Ser Ala 
15 10 

(2) INFORMATION FOR SEQ ID NO: 7: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met He Val Ser Asp He Glu Ala Asn Ala 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GACGTTTCAA GCGCTAGGGC AAAAGA 
(2) INFORMATION FOR SEQ ID NO : 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Arg Arg Val Gly Lys Met Val Asn Phe Ser He He Tyr Gly 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 




-72- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Arg Ser Ala Lys Ala lie Asn Phe Gly Leu lie Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Gin Ala Ala Lys Ala lie Thr Phe Gly lie Leu Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg Asp Asn Ala Lys Thr Phe lie Tyr Gly Phe Leu Tyr Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Arg Arg Ala Ala Lys Thr lie Asn Phe Gly Val Leu Tyr Gly 
1 5 .10 

(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTATATTATA GAGTAGTTAA CCATCTTTCC A 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GTAGGCCAGG GCTGTGCCGG CAAAGAGAAA TAGTC 
(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GAAGCATATC CTTGGCGCCG GTTATTATGA AAATC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 694 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 
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(B) LOCATION: 2 . . 691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

G GAT CCA GAC TGG TGG ATC GTC AGT GCG GAT TAT TCC CAA ATA GAA 4 6 

Asp Pro Asp Trp Trp lie Val Ser Ala Asp Tyr Ser Gin He Glu 
895 900 905 

CTC AGA ATC CTC GCT CAT CTC AGT GGT GAT GAG AAC CTT GTG AAG GCC 94 
Leu Arg He Leu Ala His Leu Ser Gly Asp Glu Asn Leu Val Lys Ala 
910 915 920 

TTC GAG GAG GGC ATC GAT GTG CAC ACC TTG ACT GCC TCC AGG ATC TAC 142 
Phe Glu Glu Gly He Asp Val His Thr Leu Thr Ala Ser Arg He Tyr 
925 930 935 940 



AAC 


GTA 


AAG 


CCA 


GAA 


GAA 


GTG 


AAC 


GAA 


GAA 


ATG 




Val 


Lys 


Pro 


Glu 


Glu 


Val 


Asn 


Glu 


Glu 


Met 










945 










950 




ATG 


GTG 


AAC 


TTC 


TCT 


ATA 


ATA 


TAC 


GGT 


GTC 


ACA 


Met 


Val 


Asn 


Phe 


Ser 


He 


He 


Tyr 


Gly 


Val 


Thr 








960 










965 






GTG 


AGA 


CTT 


GGA 


ATA 


CCG 


GTT 


AAA 


GAA 


GCA 


GAA 


Val 


Arg 


Leu 


Gly 


He 


Pro 


Val 


Lys 


Glu 


Ala 


Glu 






975 










980 








TAT 


TTC 


ACA 


CTG 


TAT 


CCA 


AAG 


GTG 


CGA 


AGC 


TAC 


Tyr 


Phe 


Thr 


Leu 


Tyr 


Pro 


Lys 


Val 


Arg 


Ser 


Tyr 




990 










995 










GCA 


GAG 


GCA 


AAA 


GAG 


AAG 


GGC 


TAC 


GTC 


AGG 


ACT 


Ala" 


"Glu" 


-Ala 


Lys 


"Glu" 


"Lys - 


Gly 


"Tyr" 


Val" 


Arg" 


"Thr" 


1005 








1010 








1015 


AGA 


GAT 


ATT 


CCC 


CAG 


CTC 


ATG 


GCA 


AGG 


GAC 


AAG . 


Arg 


Asp 


He 


Pro 


Gin 


Leu 


Met 


Ala 


Arg 


Asp 


Lys j 










1025 








1030 


GGC 


GAA 


AGA 


ATC 


GCG 


ATA 


AAC 


ACC 


CCC 


ATT 


CAG 


Gly 


Glu 


Arg 


He 


Ala 


He 


Asn 


Thr 


Pro 


He 


Gin 



955 



970 



985 



1000 



2u Phe" Gly Arg Lys 
1020 



1035 



190 



238 



286 



334 



382 



430 



478 



1040 1045 1050 

ATA ATA AAA TTG GCT ATG ATA GAT ATA GAC GAG GAG CTG AGA AAA AGA 52 6 

He He Lys Leu Ala Met He Asp He Asp Glu Glu Leu Arg Lys Arg 
1055 1060 1065 

AAC ATG AAA TCC AGA ATG ATC ATT CAG GTT CAT GAC GAA CTG GTC TTC 57 4 

Asn Met Lys Ser Arg Met He He Gin Val His Asp Glu Leu Val Phe 
1070 1075 1080 

GAG GTT CCC GAT GAG GAA AAA GAA GAA CTA GTT GAT CTG GTG AAG AAC 622 

Glu Val Pro Asp Glu Glu Lys Glu Glu Leu Val Asp Leu Val Lys Asn 

1085 1090 1095 1100 

AAA ATG ACA AAT GTG GTG AAA CTC TCT GTG CCT CTT GAG GTT GAC ATA 67 0 

Lys Met Thr Asn Val Val Lys Leu Ser Val Pro Leu Glu Val Asp He 
1105 1110 1115 

AGC ATC GGA AAA AGC TGG TCT TGA 6 94 
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Ser lie Gly Lys Ser Trp Ser 
1120 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 230 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Asp Pro Asp Trp Trp lie Val Ser Ala Asp Tyr Ser Gin lie Glu Leu 
15 10 15 

Arg lie Leu Ala His Leu Ser Gly Asp Glu Asn Leu Val Lys Ala Phe 
20 25 30 

Glu Glu Gly lie Asp Val His Thr Leu Thr Ala Ser Arg lie Tyr Asn 
35 40 45 

Val Lys Pro Glu Glu Val Asn Glu Glu Met Arg Arg Val Gly Lys Met 
50 55 60 

Val Asn Phe Ser lie lie Tyr Gly Val Thr Pro Tyr Gly Leu Ser Val 
65 70 75 80 

Arg Leu Gly lie Pro Val Lys Glu Ala Glu Lys Met lie lie Ser Tyr 
85 90 95 

_Phe_ _Thr_L.e.u._T_yr_.Er.o. _Lys _V_al. Ar_g__Ser__T_yr Ile__Gln _Gln_ Val_V_al_ Ala_ 
100 105 110 

Glu Ala Lys Glu Lys Gly Tyr Val Arg Thr Leu Phe Gly Arg Lys Arg 
115 120 125 

Asp lie Pro Gin Leu Met Ala Arg Asp Lys Asn Thr Gin Ser Glu Gly 
130 135 140 

Glu Arg lie Ala lie Asn Thr Pro lie Gin Gly Thr Ala Ala Asp lie 
145 150 155 160 

lie Lys Leu Ala Met lie Asp lie Asp Glu Glu Leu Arg Lys Arg Asn 
165 170 175 

Met Lys Ser Arg Met lie lie Gin Val His Asp Glu Leu Val Phe Glu 
180 185 190 

Val Pro Asp Glu Glu Lys Glu Glu Leu Val Asp Leu Val Lys Asn Lys 
195 200 205 

Met Thr Asn Val Val Lys Leu Ser Val Pro Leu Glu Val Asp lie Ser 
210 215 220 

lie Gly Lys Ser Trp Ser 
225 230 



(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Phe Leu Phe Asp Gly Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Leu Leu Val Asp Gly His 
1 5 

J2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Leu lie Thr Gly Asp Lys Asp Met Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Arg lie Leu Thr Ala Asp Lys Asp Leu Tyr 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

TCGTACCNGG GNTCNCNANA TCGACTGCAG CATGCAAGCT GGCTAATCAT GGTCATAGCT 60 

GTTTCCTGTG TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG CCGGAAGCAT 120 

AAAGTGTAAA GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTGCGCTC 18 0 

ACTGCCCGCT TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG 24 0 

CGCGGGGAGA GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT 300 

GCGCTCGGTC GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT 3 60 

-ATCCACAGAA-TCAGGGGATA- AC G C AG G AAA" G AAC AT G T GAT GCAZOQTGGCC AGCAAAAGGC" '"" "4 20 " 

CAGGAACCGT TAAAAAGGCC GCGTTGCTGG GCGTTTTTCC ATAGGCTCCG CCCCCCTTGA 4 80 

CGAGCATCAC AAAAAT TCGA CGCTTCAAGT TCAGAGGTGG GCGAAACCCG ACAGGGACTA 540 

TAAAGATTAC CAGGGCGTTT TCCCCCTGGG AAGCTNCCTT CGTGCGCTCT CCTGTTCCCG 600 

AACCTGGCCG GTTTAACCGG GATACCNGNT CGGCCTTTTN TCCCCTTNGG GGGAANCCTT 6 60 

GGGGNTTTTN GNAAAANGCT AAGGGTT 687 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GCTCGTACCG GGGATCTNNN ANATCGACTG CAGCATGCAA GCTTGGCGTA ATCATGGTCA 



60 
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TAGCTGTTTC CTGTGTGAAA TTGTTATCCG CTCACAATTC CACACAACAT ACGAGCCGGA 120 

AGCATAAAGT GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT AACTCACATT AATTGCGTTG 180 

CGCTCACTGC CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC AGCTGCATTA ATGAATCGGC 24 0 

CAACGCGCGG GGAGAGGCGG TTTGCGTATT GGGCGCTCTT CCGCTTCCTC GCTCACTGAC 300 

TCGCTGCGCT CGGTCGTTCG GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA 3 60 

CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA 4 20 

AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGGCGTTT TTTCCATAGG CTCCGCCCCC 4 80 

CTGANGAGCA TCANAAAAAT CGANGCTCAN GTCANAGGTG GCGAAACCCG ACAGGNCTAT 54 0 

TAAAAGATNC CCAGGCGTTT TCCCCCCTGG GAAGCTCCCT CGTGGGGCTC TCCTGGTTNC 600 

GGNNCCCTGN CCGGNTTACC GGGGATAANC TTGTTCCGGN CTTTNTCCCC TTCNGGGAAA 6 60 

ANGGTGGGGG GTTTTNTNNA AAAGGCTCAA AGGCTGGTAN G 701 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: both 
(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GNTNTAGNNN GGNCTAANNG GCGGGGAAAT CGAGCTCGGT ACCCGGGGAT CCTCTAGAGT 60 

CGACCTGCAG GCATGCAAGC TTGGCGTAAT CATGGTCATA GCTGTTTCCT GTGTGAAATT 120 

GTTATCCGCT CACAATTCCA CACAACATAC GAGCCGGAAG CATAAAGTGT AAAGCCTGGG 18 0 

GTGCCTAATG AGTGAGCTAA CTCACATTAA TTGCGTTGCG CTCACTGCCC GCTTTCCAGT 24 0 

CGGGAAACCT GTCGTGCCAG CTGCATTAAT GAATCGGCCA ACGCGCGGGG AGAGGCGGTT 300 

TGCGTATTGG GCGCTCTTCC GCTTCCTCGC TCACTGACTC GCTGCGCTCG GTCGTTCGGC 360 

TGCGGCGAGC GGTATCAGCT CACTCAAAGG CGGTAATACG GTTATCCACA GAAATCAGGG 420 

GATAACGCAG GGAAAGAACA TGTGAGCAAA AGGCCCAGCA AAAGGCCAGG AACCCGTAAA 4 80 

AAGGCCGCGT TGCCTGGCGT TTTTCCATAG GCTCCGCCCC CCTTGACGAG C AAT C AC AAA 54 0 

AATCGACGCT CAAAGTCAAG AGGTGGCGAA ACCCCGACAG GGACTTATAA AGATACCCAG 600 

GCCGTTTCCC CCTGGAAGCT CCCCTCCGTG CGCTTCTCCT TGGTTCCCGA CCCTGCCGCT 660 

TTACCNGGAT NCCTGTCCGC CCTTTTNTCC CTTTCNGGNA ACCGGGCGCT TTTTTTT 717 
(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: both 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



NNNNCNNNNG 


GCTGANAGCG 


ATAAATCGAG 


CTCGGTACCC 


GGGGATCCTC 




fin 


CTGCAGGCAT 


GCAAGCTTGG 


CGTAATCATG 


GTCATAGCTG 


TTTCCTGTGT 


GAAATTGTTA 


120 


TCCGCTCACA 


ATTCCACACA 


ACATACGAGC 


CGGAAGCATA 


AAGTGTAAAG 


CCTGGGGTGC 


180 


CTAATGAGTG 


AGCTAACTCA 


CATTAATTGC 


GTTGCGCTCA 


CTGCCCGCTT 


TCCAGTCGGG 


240 


AAACCTGTCG 


TGCCAGCTGC 


ATTAATGAAT 


CGGCCAACGC 


GCGGGGAGAG 


GCGGTTTGCG 


300 


TATTGGGCGC 


TCTTCCGCTT 


CCTCGCTCAC 


TGACTCGCTG 


CGCTCGGTCG 


TTCGGCTGCG 


360 


GCGAGCGGTA 


TCAGCTCACT 


CAAAGGCGGT 


AATACGGTTA 


TCCACAGAAT 


CAGGGGATAA 


420 


CGCAGGAAAG 


AACATGTTGA 


GCAAAAGGCC 


AGCAAAAGGC 


CAGGAACCGT 


AAAAAGGCCG 


480 


CGTTTGCTGG 


CGTTTTTCCC 


ATAGGCTCCG 


CCCCCCTTGA 


CGAACCATCA 


CAAAAATCGA 


540 


CGCTCAATTC 


AGAAGTTGGC 


GAAAACCCGA 


CAGGACTAAT 


AAAG AT AC C C 


AGCGTTTCCC 


600 


CCCCTGGAAA 


CTCCCCTCCG 


TTGCGCCTCT 


CCCTGTTCCC 


GAACCTTGCC_ 


CGCTTACCGG 


660 


GAATACCTTG 


TCCNCCTTTT 


CTCCCCTTCC 


GGGAANCGTT 


NGCGCCTTTC 


CCC 


713 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

Gly Ser Arg Leu Val Asp Arg Gin Cys Gly Leu Phe Pro Asn Arg Thr 
15 10 15 

Gin Asn Pro Arg Ser Ser Gin Trp 
20 

(2) I N FORMAT I ON FOR SEQ ID NO: 28: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Glu Pro Cys Glu Gly Leu Arg Gly Gly His Arg Cys Ala His Leu Asp 
15 10 15 

Cys Leu Gin Asp Leu Gin Arg Lys Ala Arg Arg Ser Glu Arg Arg Asn 
20 25 30 

Ala Thr Gly Trp Lys Asp Gly Glu Leu Leu Tyr Asn lie Arg Cys His 
35 40 45 

Thr Val Arg Ser Phe Cys Glu Thr Trp Asn Thr Gly 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

Ui)___MOLECULE_TYPE.:_..pep_tide : 



(xi) SEQUENCE DE 

Arg Ser Arg Lys 
1 

Ala Lys Leu His 
20 

Arg Gin Asp Ser 
35 

Lys Gly Gin Glu 
50 

Pro His Ser Gly 
65 

Tyr Arg Arg Gly 



Ser Gly Ser 



ICRIPTION: SEQ I 

Asp Asp Tyr Gin 
5 

Pro Ala Gly Cys 



Leu Trp Lys Lys 
40 

His Pro Val Arg 
55 

Asn Cys Gly Arg 
7 0 

Ala Glu Lys Lys 
85 



) NO:29: 

Leu Phe His Thr 
10 

Cys Arg Gly Lys 
25 

Lys Arg Tyr Ser 



Arg Arg Lys Asn 
60 

Tyr Asn Lys lie 
75 

Lys His Glu lie 
90 



Val Ser Lys Gly 
15 

Arg Glu Gly Leu 

30 

Pro Ala His Gly 
45 

Arg Asn Lys His 



Gly Tyr Asp Arg 
80 

Gin Asn Asp His 
95 



(2) INFORMATION FOR SEQ ID NO: 30: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Arg Thr Gly Leu Arg Gly Ser Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 6 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Gly Lys Arg Arg Thr Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ser Gly Glu Glu Gin Asn Asp Lys Cys Gly Glu Thr Leu Cys Ala Ser 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

His Lys His Arg Lys Lys Leu Val Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

lie Gin Thr Gly Gly Ser Ser Val Arg lie lie Pro Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Asn Ser Glu Ser Ser Leu lie Ser Val Val Met Arg Thr Leu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Arg Pro Ser Arg Arg Ala Ser Met Cys Thr Pro 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Leu Pro Pro Gly Ser Thr Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
Ser_Gln_ Lys __Lys 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Thr Lys Lys Cys Asp Gly Leu Glu Arg Trp 
15 10 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Tyr Thr Val Ser His Arg Thr Val Phe Leu 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 41: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Asp Leu Glu Tyr Arg Leu Lys Lys Gin Lys Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO : 4 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(D)-TOPOLOGY:— not" relevanl 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Leu Ser Ala He Ser His Cys He Gin Arg Cys Glu Ala Thr Ser Ser 
15 10 15 

Arg Leu Leu Gin Arg Gin Lys Arg Arg Ala Thr Ser Gly Leu Ser Leu 
20 25 30 

Glu Glu Lys Glu He Phe Pro Ser Ser Trp Gin Gly Thr Arg Thr Pro 
35 40 45 

Ser Pro Lys Ala Lys Glu Ser Gin 
50 55 

(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Thr Pro Pro Phe Arg Glu Leu Arg Gin He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44 

Glu Lys Glu Thr 
1 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(-B-) -TOPOLOGY- not" relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Ser Phe Arg Phe Met Thr Asn Trp Ser Ser Arg Phe Pro Met Arg Lys 
1 5 10 15 

Lys Lys Asn 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



# 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46 

Asn Ser Leu Cys Leu Leu Arg Leu Thr 
1 5 

2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Ala Ser Glu Leu Ala Gly Leu 
1 5 



